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Abstract 

A function f = fx is called least energy approximation to a function 
B on the interval [0,T] with penalty Q if it solves the variational 
problem 

[/'h)^ + Q{f{t) - B{t))] dt \ min . 

For quadratic penalty the least energy approximation can be found 
explicitly. If B is a random process with stationary increments, then 
on large intervals fx also is close to a process of the same class and 
the relation between the corresponding spectral measures can be found. 
We show that in a long run (when T —>■ oo) the expectation of energy of 
optimal approximation per unit of time converges to some limit which 
we compute explicitly. For Gaussian and Levy processes we complete 
this result with almost sure and convergence. 

As an example, the asymptotic expression of approximation energy 
is found for fractional Brownian motion. 
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1 Introduction 


Least energy approximations play important role both in pure and applied 
mathematics. The most important approximation of this kind is known 
under the name of taut string. 

Given a target function B{-) and a nonnegative width function r(-) de¬ 
fined on a time interval [0, T] the taut string is a function fy providing 
minimum for the energy functional 

Lxih) ■.= f h'{t)‘^dt 

Jo 

among all absolutely continuous functions h with given starting and final 
values and satisfying 


B{t) — r{t) < h{t) < B{t) + r(f), 0 < t < T. 


The same function optimizes (under the same restrictions) the graph length 
/q y/l + h'{t)‘^dt, variation jg \h'{t)\dt, and other functionals represented 
as integrals of a convex function of h'. 

Taut string is a classical object well known in Variational Calculus, in 
Mathematical Statistics, see and in a broad range of applications such 

as image processing, see m Chapter 4, Subsection 4.4] or communication 
theory, see [12]. 

For the case when a random function B{-) is approximated, very few 
information is available. Lifshits and Setterqvist studied in [5] the energy of 
taut string accompanying Wiener process. 

Unfortunately the taut string is rather hard to describe and to compute 
explicitly. Therefore, the study of other least energy approximations makes 
sense. One possible alternative is to replace the rigid boundary restrictions 
by introducing some penalty function that controls the deviation from the 
target function. 

A function / = /t’ is called least energy approximation to a function B 
on the interval [0, T] with penalty Q, if it solves the variational problem 

[/'(i)^ + - B{t))] dt \ min . 

Notice that this approach is very much in the spirit of interpolation theory 
from functional analysis. The classical taut string can be formally included 
in this setting by using time-inhomogeneous penalty 



Q{x,t) 


0, jxj < r{t), 

-|-oo, jxj > r{t). 


One of the most natural choices for penalty is the quadratic penalty 
Q{y) = where one can advance sufficiently far with explicit calculations. 
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Figure 1: The least energy approximation of centered Poisson process. 


For quadratic penalty the least energy approximation can be found ex¬ 
plicitly. We study its behavior in a long run (when T —)• oo) and show that 
under weak assumption on B it converges to some limit with exponential 
rate. 

In Section we provide necessary exact and asymptotic formulas for the 
least energy approximation of a deterministic function. In Section the 
central results of the article related to the approximation of a random pro¬ 
cess with stationary increments are obtained. If i? is a random process with 
stationary increments, then on large intervals its least energy approximation 
(cf. Figure also is close to a process of the same class and the relation 
between the corresponding spectral measures can be found. We show that 
in a long run (when T —)• oo) the expectation of energy of optimal approx¬ 
imation per unit of time converges to some limit. For Gaussian and Levy 
processes we complete this result with almost sure and convergence. As 
an example, the asymptotic expression of approximation energy is found for 
fractional Brownian motion. In view of the importance of Wiener process, 
we propose an alternative approach to its least energy approximation in 
Section HI 

Finally, we wish to notice that our results can be considered as a com¬ 
plement to those of traditional stochastic control theory, where the best 
approximating function is chosen among the adapted ones, i.e. its value at 
time t must be determined by the values of the target function B(s), s < t. 
Adaptive approach is more realistic but it leads to the problems solvable 
mainly for Markov target processes (see e.g. in, m for the least energy 
approximation of Wiener process). One may consider our results as the 
lower bounds for the least energy achievable by adaptive control in case it 
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is unknown, or as the evaluation of price to be paid for not knowing the 
future, in case when both optimal adaptive and non-adaptive least energy 
approximations are known. 

2 Least energy approximation: deterministic set¬ 
ting 

2.1 Approximation on a fixed interval 

The following variational problem (which is our starting point) and its so¬ 
lution are quite standard facts from variational calculus. We provide the 
proof for the sake of completeness but postpone it to the end of the article. 

Let B{-) be a fixed deterministic function on an interval [0,T]. In this 
section we deal with minimization problem of the functional 

£tU) ■■= r [fit? + Qifit) - Bit))] dt 
Jo 

over the Sobolev space W2[0,r] of all absolutely continuous functions / : 
[0,r] —7- M having square integrable derivative. Here Qi-) is an appropriate 
penalty function. 

Proposition 2.1 Let B{-) be a measurable function on [0,T] and let Q{-) 
be a strictly convex non-negative differentiable function on M such that 

lim Q(x) = -|-oo. 

a:—>-iboo 

Assume that either B is bounded or for some A > 0, p > 0 it is true that 
B G Lp+i[0, T] and 

|(5'(x)| < -I-1), xGR. (1) 

Then there exists a unique solution fx of the problem 

TT(/)\min, /GWi[0,r]. (2) 

This solution has absolutely continuous first derivative fff) and satisfies 
the equation 


2 f!f{f) = Q'ifrit) - B{f)), for a.e. t € {0,T), 

and the boundary conditions (/t)V(O) = ifxY-iT) = 0. 

Remark 2.2 In the following we will apply this proposition to random 
functions B that are not necessarily bounded but belong to L2[0,T] almost 
surely. Therefore, assumption Q is adequate. 
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For the rest of the paper, we restrict attention to the quadratic penalty 
Q{y) = 2/^ because in this case we are able to obtain explicit and quite 
meaningful results. 


Proposition 2.3 Let B G L2[0,T]. The solution of the minimization prob¬ 
lem with quadratic penalty 

rT 


^rif) ■= / [f'{tf + \fit) - B{t)\‘^] dt \ min 
Jo 

is given for 0 <t <T by 

frit) = - I B{s)smh.{t - s) ds ^ ^—■ (e* + e *) . (3) 


'0 


e^ — e 


Moreover, for 0 < t <T we have 


ff'{t) = — B{s) cosh(t — s) ds -\- 


B{s) cosh(T — s) ds 


e^ - e-T 


-{e^-e-^). (4) 


Proof: From Proposition [2?^ we know that the minimizer fT exists uniquely 
and solves the differential equation 


f"{t) = f{t)-B{t) 


( 5 ) 


with boundary conditions 


4(o) = /:(r) = o. 


The general form of the solution for linear equation ([^ without boundary 
conditions is 

f{t) = U{t) + Cie^ + C2e-^ 

where /* is any fixed solution of ([^. 

Let us check that 


/■* 1 

/*(t) := — / B{s)smh.{t — s)ds = - 

Jo 2 


e * f B{s)e^ds — e* / B{s)e ^ds 
Jo Jo 


indeed provides a solution of ([^. We have 


m = 2 


—e * f B{s)e^ds ->r e *‘B{t)e^ — e*' f B{s)e ^ds — e^B{f)e * 

Jo Jo 

e“* f B{s)e^ds + e* / B{s)e~^ds 
Jo Jo 


= — B(s) cosh{t — s)ds. 


( 6 ) 
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It follows that 




—e 


-t 


B{s)e^ds + e ^B{t)e^ + / B{s)e ^ds + e^B{t)e 


-t 


— t 


[ B{s)e^ds - e* / B{s)e-^ds - 2B{t) 

Jo Jo 


= f4t)-B{t), 
and we see that ([^ holds. 

Next, we adjust the coefficients Ci and C 2 by using boundary conditions. 
From 

0 = 4(0) = (/,)'+(0) + Ci-C2 = Ci-C2 

we derive Ci = C 2 - From 

0 = f_{T) = {f,y_{T) + Cl - C2 e-'^ = (/*)'_(T) + Ci (e^ - e"'^) 
by using (1^ we obtain 


Cl = 


-(/*)'_(F) _ Jq B{s) cosh{T - s) ds 


- e-T 


- e-^ 


and arrive at formulas (§ and 0. 


□ 


2.2 Approximation in a long run 

We are going now to study the behavior of the least energy approximation 
in a long run, i.e. when the subject of approximation, function B{-), is fixed 
while the interval length T goes to infinity. 

In view of future applications it will be more convenient for us to let 
B{-) be defined on the entire real line. Although approximation problem 
involves only the restriction of B on the positive half-line, if necessary, one 
can always extend B to the negative half-line by assigning it zero values. 

Recall that the function /t(-) defined by formula ^ provides the least 
energy approximation to B{-) on the interval [0,T]. 

We first derive a simple approximative heuristics for fx, then transform 
this heuristics into a rigorous result. 

2.2.1 Heuristics 

We will simplify the expressions for @ and as follows. Assume that 
t,T,T — t,T — s —>■ -|-oo and drop all small exponential terms e“*, e“^, 
where appropriate. 

In particular we let 

+ ^(e^-^ + e^-4e-V = ^(e*-^ + e^+^-24 

^ ^ ^ gS-4-2(T-t)^ _ 1 
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By plugging this into ([^ and Q , we get 

frit) ^ j B{s) ^sinh(s — t) — ^ ds + e^~^ds 

= I f B{s) e^-^ds + \ r B{s) e^-^ds 

^ Jo ^ Jt 

1 r'^ 

= - / B{s) 

2 Jo 

Similarly, 

frit) -7y[ B{s) sgn(s - t) 

Assuming that the function B{-) is defined on entire M, it is more natural 
to use approximations based on ’’stationary” kernels, i.e. frit) ~ f{t) and 
frit) ~ f'it): where 

^ 1 

^B{s)e-'-‘'ds ( 7 ) 

and 

^ 1 

fit) = 1.1 B{s)sgn{s-t) e~\^~^\ds. (8) 

^ J — OO 

Notice that the approximations f and f do not depend on T. This 
shows a local nature of the least energy approximation in a long run. 

2.2.2 Rigorous result 

The following result shows that the least energy approximation is exponen¬ 
tially close to its stationary approximation at the bulk values of time. 

Proposition 2.4 Assume that 

|R(s)| < C(|s|+ lf, sGM, (9) 

for some C,p > 0. Let fx he the least energy approximation given by (© 
and let approximation f be given by Q. 

Then for all T > 1 and all t G [0, T] we have 

Ifft) - m <CAp{T+ ly (e-‘ + e-'^ + , (10) 

and 

fft) - f'{t)\ < C Ap (T + 1)^’ (e-* + e-^ + , (11) 

where a constant Ap depends only on p. 
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Remark 2.5 Since we are tempted by maximal generality in what concerns 
B, we will not use this proposition directly in the stochastic part of the article 
because of the weak but still a bit restrictive assumption Q. Instead, we 
will use the elements of its proof later on. 


Proof of Proposition 2.4| Using the definitions Q and Q we see that 

\fT{t) - fit)\ < h+ I 2 + h, 


where 


h 

h 

h 


|S(s)|e-(*-") ds, 


/ \B{s)\\KT{s,t) - /2\ ds, 

JQ 

/*oo 


|R(s)|e-("-*) ds, 


and 


KT{s,t) := jy (e* + e-*) 


e-^ — e 

^t—s 


+ (l + e-2‘) (l-e-2^)"'. 


By (§ we have 


h < 


Ce 


-t /-O 


(|s| + lf e* ds e-* 


and 


h < 


r<p-{T-t) roo 

- [ (s + lfe-^^-^Us 

^ Jt 

C'e-U-t) roo 

= / {T+l+ufe-^du 

^ Jo 

(72^6" 

< - / {{T + l)P + uP)e-^du 

^ Jo 

< C + 1)P 


In order to evaluate I 2 , notice that 


KrisA) - 


gi s s 


[(1 + /ll)(l + ^2)(l ~ ^ 3 ) ^ ~ 1 ] 


where hi := e /12 := e := e Notice that assumption T > 1 

yields 0 < /i 3 < ^ ■ Therefore, we have 


(1 - / 13 ) = 1 + 


/i3 


1 — he 


< 1 + 2/13. 
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Furthermore, inequalities 0<hi<l,0</i2<l,0</i3<2 yield 

(1 + hi)[l + h 2 ){l + 2/13) < 1 + 4/11 + 2/i2 + 2/13. 

We infer that 


r,t — S 


0 < t) — 


It follows that 


where 


< 2 ( 4 hi + 2/i2 + 2/13) 

< 2 

= 2 (e-^r+t+s g-i-s ^t-s- 2 T\ 


-I 2 < 2 (Is + le + h ), 


( 12 ) 


I 5 := / (s + l)?’e"ds<Ce-(^-*)(r + l)P, 


'0 


Ig := Ce-* / (s + l)Pe-"ds < Cyl(3)e-‘, 

Jo 

fT 

I 7 := ~ 


/ (s + 1)^ e-" ds < C 43) ^-{T-t) _ 

'0 


By combining all estimates we obtain (10). 


The proof of 0 is completely similar. One should use the function 

ifUM) F 


— e 


instead of K'j'{s,t). Then (12) is replaced by 


n.t — S 


— e 


^—t—s 


< iFr(s,t) - ^ < (e"2^+*+" + , 


and all calculations go in the same way. 


(13) 

□ 


Once the proposition is proved, we have straightforward integral esti¬ 
mates. Let II • || 2 ,t denote the norm in the space L2[0,T]. 


Corollary 2.6 Under assumptions of Proposition 2.1. we have 


and 


\\fT-f\\ 2 ,T< 2 CAp{T + ir 
\\f^-f'h,T<2CAp{T + ir. 


( 14 ) 
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Proof: We have 


II/t — /Ili.T — 


< 

< 

< 

< 


Jo 

Al (T + ifp (e-^ + e-^ + ^ dt 

3 Al (T + ifP + e"^'^ + e"2('r-*)^ dt 

3 Al (T + ifP Q + re-2^ + 0 
4c2 Al (r + i)2p, 


where at the last step we used assumption T > 1. 
The proof of (14) is completely similar. 


□ 


3 Application to processes with stationary incre¬ 
ments 


3.1 A brief reminder on the processes with stationary incre¬ 
ments 

A complex-valued random process G M, is called process with sta¬ 

tionary increments in the wide sense if it has finite second moments and 
the mean and the covariance of the process Bf^it) := B{to + t) — B{to) are 
the same for all to £ 1^- h such process is stochastically continuous, then it 
admits a spectral representation of the form 


B{t) = Bo + Dot+ [ - l)A(du), (15) 

Jm\{o} 


where Bq,Dq are random variables with finite second moment and X{du) 
is a complex-valued zero mean uncorrelated random measure on ]R\{0}, 
uncorrelated with Hq, see m- In the following we assume i?o = 0 since the 
initial value is irrelevant to our purposes. 

Recall that the covariance structure of B is characterized by a determin¬ 
istic spectral measure ytB on M\{0} defined by Pb{A) := E|A(A)p. The 
spectral measure may be infinite but it must satisfy Levy’s integrability 
condition 


,l)hB{du) < oo. 


In the sequel, B{t),t G M, denotes a stochastically continuous, real¬ 
valued process with stationary increments in the wide sense such that B (0) = 
0. We always work with a measurable version of B. Let us stress that even 
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though B is a real-valued process, the random measure X may be complex, 
yet it must satisfy condition X{—A) = X(A). 

The standard deviation of B grows at most linearly. We will use the 
following estimate: 

E|B(f)|2 = E|T»o|V+ [ 

Jr\{0} 

< E\Do\‘^t‘^ + j + 4 :Hb{E\[-1,1]} 

< A{t^ + 1), 


where 




A := max E |L)oP + / 4^s{M\[—1,1]} ^ . 


It follows that for any T > 0 


E / \B{t)\'^dt < oo, 


therefore B G L2[0,T'] almost surely, and Proposition 2.1 applies to the 
sample paths of B with p = 1. 

If we additionally assume that the random measure X is Gaussian, then 
the process B is also Gaussian. Fractional Brownian motions W^^\t), 0 < 
H < 1, are the most known Gaussian processes with stationary increments. 
The range of parameter 0 < < 1 is associated to the family of spectral 

measures 


Mndu 

f-nidu) ,,2 H+i ’ 


Mh := 


T{2H + l)sia{'KH) 


27r 


with Do = 0. We have a power-type variance 

itu ,,2MHdu 


/ ' 


- 1 


—oo 
oo 


\U 


2H+1 


= 2 


(1 — cos(fn)) 


Mh du 


\u 


2H+1 


= 4MH|t| 


2H 


(1 — cosu) 


dv 


2H+1 


Jo 

AMh\ t\^^{2H)-^ 


V 

°° . dv 
sinv 


2H 


TT 


2r(2id) sin(7rD) 


\2H 
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The case H = 1 is degenerate linear, i.e. = Dot with Dq being a 

standard Gaussian random variable. The remarkable properties of fractional 
Brownian motions are described e.g. in PUI Section 7.2] and in [21 Chapter 

4]. 

Wiener process W is a special case of fractional Brownian motion corre¬ 
sponding to H = ^. It has a spectral measure ^w{du) = . 


3.2 Convergence of average least energy 

The following result describes the behavior of the average least energy ap¬ 
proximation for arbitrary process with (wide-sense) stationary increments. 
We call 

StU, B) := r [{f{s) - B{s))^ + fisf] ds 
Jo 

the energy of function / with respect to B on the interval [0,T]. If the 
function B is fixed, we omit it from the notation. 


Theorem 3.1 Let B{t),t G M, be a stochastically continuous process with 
wide-sense stationary increments given by its spectral representation (15). 
Recall that fr denotes the minimizer of Srif, B) over all f G W^lO, Tj. 
Then, 

(16) 


hm r-'ETr(/T,5) =C, 
T^oo 


where 


C := E jDol + 


Ir\{0} 


1 -|- 


RB{du). 


(17) 


The constant C means the smallest average amount of energy per unit 
of time needed for approximation of B. 


Corollary 3.2 For the fractional Brownian motion we have 


C 



Mfjdu 

1-I- 

“ y-^dv 
1 -i-v 


2Mh 


Mh- 


TT 

sin(7ri7) 


r(2i7-h 1) 
2 



du 

1 -|- 


For the Wiener process H = ^ this yields the constant 



that can be also obtained by other method (see Section^ below). 


( 18 ) 
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Remark 3.3 We can give an alternative nonspectral representation for the 
constant C, namely, 


1 

C = (E Dof + - / Var(B(s)) e""ds. 

2 Jo 

Indeed, 

1 

{EDof + - Yav{B{s))e-^ds 
2 Jo 

1 

= {EDof + - Var(Z)o) 

2 Jo 

+h / if |1 - /is(dn) ) e“®ds 
2 Jo \Jk\{ 0 } / 

= ED^+^ [ f/~(2-e““-e““)e-"ds^/iB(dti) 

2 Jr\{o} VJo / 

= + ^ f ( 2 -UBidu) 

2 Jr\{o} V l-iu l + iuj 

r ^2 

= EE>o+ / ——^ pLB{du) =C. 

Jr\{o} 1 + ^ 

Remark 3.4 Wonld we introduce a viscosity constant k > 0, i.e. change 
the penalty to Q{y) = by a linear time change the minimal energy 

approximation problem on [0, T] for B reduces to that for the process B{t) := 
B{t/K) on the interval [0,kT] with k = 1. Thus the limit energy constant 
from ( |17[ ) becomes 

r 2 2 

C^, :=E\Do\‘^ + 2^2 d'Bidu) 

Jr\{o} + 'U' 

in this case. 


Remark 3.5 It is worthwhile to compare the least energy approximation 
constant for Wiener process (18) with analogous result of adaptive control, 
see [H p.319]. It turns out that the optimal adaptive control requires two 
times more energy than the non-adaptive one. 


Proof of Theorem |3.1[ Consider approximations Q and Q related to 
B. They become 

fit) = Dot+l r [ e-l^-‘l(e*--l)X(dn)ds 

J — 00 Jr\{o} 

:= Dot+ [ K^'^\t,u)X{du) 

Jr\{0} 
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and 


f'{t) = Do + ]- f f sgn{s-t)e - l) ds 

^ J — oo ^M\{0} 

K^^\t,u)X{du), 


'■= Do + 


where we have expressions for respective kernels 

1 /*oo 

K^o\t,u) = ^ - 1) ds 

^ J — OO 

1 

= - / e-l^le““dn - 1 

^ J—oo 


= - 
2 

Atu 


1 1 

— + 


1 + 


1 + m 1 — in 


- 1 , 


- 1 


and 


1 

K^^){t,u) = sgn(s - t) (e““ - 1) ds 

2 J-OO 


1 roo 
— _ 

2 


1 

“ 2 ® 


itu 


/ oo 

sgn(n) dv 

-OO 


= - e 

2 


itit 


1 1 

+ 


1 + in 1 — in 


in 

1 + n^ 


We conclude that 
f{t)-B{t)= ' 

and 


-1 e**“X(dn) = 


—n 


]R\{o} \^ + u^ J '' ' Jr\{o}^ + u‘^ 


fit) = Do + 


1 + n^ 


e**“X(dn). 


e**“X(dn) 


(19) 


We see that both deviation f — B and derivative /' are wide sense stationary 
processes. By the well known isometric property we have 


E|/(t)-i?(t)|2 = 


n 


'M\{0} 


(1 + U?) 


/UB(dn), t G 
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and 


fiB{du), t G M. 


E\f\t)\^ = E\Do\^ + 


u 

(l + n2)2 


Hence, for any T > 0 we have 


E£Tif,B) = 


>0 


= T 

= T 


E|/(f)-H(i)|2+E|/'(f)|2 
E|/(0)-H(0)|2 + E|/'(0)|2 
+ m" 


dt 


E Dor + 


= T 
= TC. 


ElDor + 


(1 + u2) 

„,2 


^iB{du) 


/e\{0} 


1 + u^ 


fj.B{du) 


It remains to show that the average energy of the least energy approximation 
Jt and that of stationary approximation / are sufficiently close, i.e. 


hm T-rE£T{fT,B)-E£Tif,B)]=0. 
T^oo 


( 20 ) 


Notice that by triangle inequality in L2[0,1] we have 

\/ Srifr, B) - \lSrif, B) < \J <?t(/t - /, 0), 


hence. 


StUt, B) - Srif, B) < StUt - 7,0) + 2^J8t{?,B) 


and 


EST{fT,B)-¥.8T{f,B) <E 8T{fT,B)-8Tif,B) 


< E8T{fT-f,0) + 2E J8T{f,B) J8T{fT-f,0) 


< Efr(/T - 7 0) + 2 ^E 8t{1 B) E 8tUt - 7 0) 

= E 8T{fT - 7 0) + 2 ^CT 8T{fT - 7 0). 

We see that 


hm r-'[EfT(/T-/,0)] = 0 

T^oo 


( 21 ) 

( 22 ) 


would imply the desired relation (20). 
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We first analyze the potential energy and prove that 

sup sup E |/'r(t) —< oo, (23) 

T>0 0<t<T 

supE |/(t) — < oo. (24) 

t>o 

The part concerning / is trivial because f — B is stationary. Furthermore, 
let us represent 


Mt) - Bit) = r RTis,t)iBis) - Bit))ds 
Jo 

where for s G [t, T] we have by 

0 < RTis,t) '.= Kj'is,t) 

< - -h 2 


r,t — S 


+ 2 _j_ g(i—s)—2T^ 


< 7e*-^ = 

while for s G [0, t] we have 

0 < RTi-s^t) := KTis,t) — smhit — s) 


= ( KTis,t) - 


ot—S 


^S—t 


+ 


< (e-2^+*+* + e“*“^ + + 


^S — t 


= ^{s-t)-2s ^(s-t)-2(T-(t-s))'^ 

< 7e^-* = 7e"l^"^l. 


^S — t 


Hence, 


E\fTit) - Bit)f < E (^7 l\-^*-^^\Bis) - Bit)\ds'^ 


<cE e-l*-"l|H(s)-H(t)|2ds 


< cA e-l*-*l[(s-t)2 + l]ds 

Jo 


and (23) follows. By using also (24) we obtain 


sup sup E |/r(t) - /(f)r < oo. 
T>0 0<t<T 


(25) 
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This estimate is still too crude, and we continue as follows. 

rO 


frit) - m = - [ 

J —OO 


—oo 
T 


+ / {KT{s,t) - e^-y2)B{s)ds- 

Jo 

:= Gi{t) + G2{t) + G3{t). 




^^B{s)d6 


For Gi we have 


E\Gi{t)f < e-^^E (^j e^B{s)\ds 


< e-^^E 


e^\B{s)\‘^ds 


< e / e^{\s\^ + l)ds ■.= ce 

J —OO 

For G 3 we have 

E\G3it)\^ < e-2(^-‘)E e^-*|.B(s)|ds^ 

( /•OO N 

J e^-^\B{s)fds 

/•OO 

< + l)ds 


< j e^-%2{s - Tf + 2T^ + 1) ds 

< cA{T^ + 1) 

This estimate is not good enough when t is close to T. This is why we need 
(25). For G 2 we have by @ 


\G2{t)\ < 2(^2,l(t) + G2,2it) + G2,3it)), 


where 


G'2,i(t) = f 

Jo 


t-2T+s 


|S(s)|(is, 


G2,2{t) = / e-^-^\B{s)\ds, 

Jo 

rT 

J-s-2T\ 


G2,3{t) = I e- 

Jo 


|i?(s)|(is. 
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Moreover, 


E|G 2 ,i(t)|^ = ( I e^\B{s)\ds 

< / e^B{s)\'^ds 


< / e^{s^ + l)ds 

Jo 

< 2y4e"2('^"*)(r2 + 1), 


E|G 2 , 2 (t)P = e-^^E (^j\-^\B{s)\ds'^ 

< e-2*E e-"|5(s)|2ds^ 


< / e-*(s" + l)ds 

Jo 

< 3Ae-^\ 


E|G2,3(t)P = e2*-4^E 




'0 


e ^|i?(s)|^ds 


< / e-"(s^ + l)ds 

'o 


,2t-4r j 

We summarize the latter calculations as 

E |/T(i) - /(t)P < (e"^* + + 1)) . 


Now we proceed with integration over [0, T]. By applying (26) on the 


[0,r — SlnT] and (25) on the interval [T — 31nT, T] we obtain 


E / IMt)-f{t)\^dt<cA{l + lnT) = o{T). 

Jo 


Kinetic energy is studied in the same fashion. By using (13) instead 


we replace (23), (24) with 


sup sup E|/y(t)f <oo, 
T>0 0<i<T 


(26) 

interval 

(27) 
of (P, 
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supE |/^(t)P < oo. 
t>0 


Hence may be (25) replaced by 


sup sup E Ifrit) - /(t)P < oo. 
r>o o<t<T 


Further, using again (13) we replace (26) with 


I/t( 0 - /'(Ol^ < cA + e-2(^-*)(r2 + 1)) . 


Integration yields 


E rifT(t)-f'(t)l^dt<cA(l + lnT). 

Jo 


By merging (27) and (28) we get 


E<?T(/r-/,0) <cH(l + lnr), 


(28) 


(29) 


which is a quantitative 


□ 


3.3 Almost sure and Li convergence 

The random variable 


Z := |L»oP-E|-DoP+C 


= \Do\^ + 


u 


Ir\{o} 


1 +U‘ 


fisidu) 


is a right candidate for the a.s. least energy limit. Notice that if B has no 
systematic drift (i.e. Dq = 0), then Z = C is a deterministic constant. 

We first develop a reduction tool showing that almost sure and Li con¬ 
vergence of the least approximation energy are reduced to the stationary 
case. 


Proposition 3.6 IfT A Z, then T B) A Z 

IfT-^£T{f,B) H Z, then T-^£T{fT,B) H Z. 

Proof: We know from ( [^ and (29) that for any T 


E 


£TifT,B)-£T{f,B) <cA,/T{l + lnT). 


(30) 


Let us fix any a > 1 and consider geometric sequence of times Tn := a^. It 
follows from (30) that 

OO 

E^T-i TT„(/T„,i?)-fT„(/,i?) <oo. 


n=l 
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Therefore 


■1-1 


£TAfT„,B)-£TM,B) 


0 


almost surely and in Li. Under assumptions of our proposition we obtain 


T-^£tMt„,B) 


almost surely (resp. in Li). 

Furthermore, since the least approximation energy £T{fT,B) is an in¬ 
creasing function of T, we have for any T G \Tn-,Tn+i] 

\T-HT{fT.B)-Z\ 

< max{|r-iTTj/T„,i?) - ^1 , - Zjj 

< amax{jT-^£T„(fT„,B) - Z\ , |r“+\^r„+i (/t„+i ,-B) - Z\} + {a- 1)Z. 

Now the a.s. convergence (resp. Li-convergence) of T~^£T{fT, B) to 
follows from that of T~^£T„{fTn, B) by letting o \ 1. 


Proposition 3.7 If the process B is Gaussian and its spectral measure pB 
has no atoms, then T~^£T{fT, B) ^ Z in Li and almost surely, as T ^ oo. 


Proof: According to Proposition 3.6, it is sufficient to prove that 


T-^£T{f,B) 


as T —)• oo, 


in Li and almost surely. We may split the energy into potential and kinetic 
parts and check that 


T 


-w \m-Bit)\^dt^ 

0 Jr\{o} (1 + 


psidu) 


and 


T-i / \f\t)\^dt^\Do\^ + 

Jo Jr\{o} (1 + 


/ 


u 


psidu). 


Recall that by (19) we have a representation 

P{t) = Do + Y{t), 

where T is a centered stationary Gaussian process with the spectral measure 


^{du) = 


(l + u2)2 


psidu). 


It follows that 
T 


r \f'{t)\^dt = \Da\^+ 2T-^Da rYit)dt + T-^ r\Y{t)\^dt. 
0 Jo Jo 
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Recall that Gaussian stationary processes whose spectral measnre has 
no atoms are ergodic, see i, m- Since both B{t) — f{t) and Y(t) belong to 
this class, we may use Birkhoff ergodic theorem and obtain convergence of 
time-averages to the corresponding expectations in any Lp, p G (1, oo), and 
almost surely, i.e. 


T-^ r \m - Bit)\^dt 

Jo 

T-i [ Y{t)dt 
Jo 

T-i [ \Y{t)\^dt 
Jo 


m\{o} (1 + “ 


2\2 


PBidu), 


m\{o} (1 + ^ 


2\2 


psidu), 


and we are done. □ 

Another interesting class of examples where we can provide an affirma¬ 
tive answer for the least energy convergence is given by Levy processes (i.e. 
processes with independent stationary increments). 

Proposition 3.8 Let B{t),t > 0, be Levy process having finite second mo¬ 
ments. Then T~^ST{fT, B) ^ Z, and T~^ST{fT, B) Z, as T ^ oo, 
where 

Z = C = |EB(1)|^ + 1A^, 

Proof: Without loss of generality, we may extend B to the negative half-axis 
in such a way that —B{—t),t < 0, is an independent equidistributed copy 
of B{t),t > 0. Looking at R as a process with stationary increments in the 
wide sense, we see that its linear part D^t is deterministic, and E R(t) = Dfi, 
while the spectral measure /is is the same as that of Wiener process up to 
the numerical factor VarR(l). It follows that the hypothetic energy limit Z 
indeed has the form given in proposition. 

Given the two-sided process B{t),t G M, we may define an associated 
Levy noise (an independently scattered homogeneons random measure) X{du) 
on M by 


X{{ti,t2]) B{t2) — B{ti), ti<t2. 


Elementary calcnlations show that 


fit) - B{t) 


1 

2 

1 

2 


e '^'^'^B{t + T)dT — B{t) 
g-|i-M| _ u)X{du). 
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and 


f'it) 


1 

2 

1 

2 


e sgn(r)i?(t + r)(ir 


Both processes belong to the class of Ornstein-Uhlenbeck processes driven 
by Levy noise. As such, they are ergodic, cf. [SI Theorem 5], |9], and their 
squares satisfy almost sure and Li ergodic theorems. By applying Proposi¬ 
tion 3.6 we complete the proof. □ 


Remark 3.9 There exist strong invariance principles providing certain close¬ 
ness between sample paths of a Wiener process and a Levy process. How¬ 
ever, they don’t seem to be good enough for the transfer of least energy 
approximation estimates. 

We conclude this series of examples by mentioning an interesting open 
problem. Let > 0, be an a-stable Levy process with 0 < a < 2. 

Then the sample paths of B are locally bounded, thus the least energy 
approximation problem is perfectly meaningful. However, since B{t) has 
inhnite second moment for any t > 0, the results of the present paper do 
not apply. We conjecture that in this case the least approximation energy 
would not grow in a quasi-deterministic linear way, but rather imitate a 
stable subordinator of order a/2 with scaling because every jump of 

B of large size r would be reflected by a fast accumulation of energy for 
approximation function of size during a finite time interval. The handling 
of such different behavior would be obviously beyond the size of the present 
contribution. 


4 Wiener process: alternative approach 

In view of the importance of the Wiener process W := we find it rea¬ 

sonable to trace an alternative approach to its least energy approximation. 
We prove that 

IE<f^r(/T, VL) ~ ^ , as Tcx), (31) 


as obtained before in (18). By the scaling property of the Wiener process, 
the optimization problem 

£Tif,W)= r + f€Wl[0,T], 

Jo 

is equivalent to the problem 

£HLW):= [\T^\f{t)-Wit)\^ + f{t)^)dt\mm,feWl[0,l]. (32) 
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Let (ej)j>i be the eigenbase of the covariance operator of W in L2[0, 1] and 
let ( 7 j)j>i be the corresponding eigenvalues. It is well known that 

7j = 1 1/2)2 ’ ej{t) = V2 sin{{j - l)Trt), j = l,2,.... 

Denoting hy wi,W 2 , ■ ■ ■ independent Gaussian random variables with E Wj = 
0 and Yai Wj = yj, we have the Karhunen-Loeve expansion of the Wiener 
process 

OO 

W(t) = tG[0,1], 

j=i 

For any absolutely continuous function / having square integrable derivative 
we may write expansions 

OO 

/ - /(O) := 

i=i 

and 

OO OO 

/' = E/.-; = E77W7re;, 

i=i i=i 

1 /2 

Since the system of functions ( 7 ^' is orthonormal, we have 


Jo ^ 


It follows that the problem (32) takes a coordinate form 




/(O) + - Wj)e^ 


i=i 


dt + Y,lj^f] 


i=i 


r2 /(0)2 + 2/(0)5 + ^|/,-u;,f +5^77V|\min, (33) 


f=i 


i=i 


(34) 


where 

OO 

:= Yj^fj 

i=i 

We first optimize over initial value /(O) with other coordinates fj fixed 
and find /(O) = —S. Now, the optimization problem becomes 


( 35 ) 
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Since partial derivatives in each fj must vanish, we find 


r2 + d- 

7j 


It follows that 




T^{2S‘^^j + 2-sJ^jSwj + wj) 


r2 + i 


„ 2T^J^jSwj + 2T^w‘^- 

2T^f,w, = - ^ ^ ^ 

7j 


( 36 ) 


We plug these expressions into (35), notice that the terms containing Swj 
cancel and obtain the least approximation energy 


CO nrpA OO / 

£TifT,W) = + + 

i=i ^ ^ i=i V 


re? . 


T2 + J- I ^ 

7j 


By using the identity 

OO 

E(27.') = 1 

i=i 

we may rewrite the least energy as 

nrp2 CO 2^2 _L 

fTfeir) = + 

j=l 7j i=l 7j 

Recall that Ew^ = 7 j. We find the average least energy 


(37) 


OO ^ 

E£T(fT, IR) = (-2E [5^] + 1) ^ ^ 


Since 


^ 1 'T' 

Y _ I _/ 

„_i 1 + ^ do 


j=l ^ ^ 

and, as we will see, 


j=l 1 + T27i 


T Y dx T 

L rT^ = i’ asT^oo, 


~ —, as T ^ OO, 


(38) 


(39) 


we arrive at (31). 

It remains to analyze the behavior of S. By using the definition of S 
in (34) and formulae (36) we get an equation 


s=Ev^r^ 

j=i \ 


(rr2 + 


Wi 




— Wi 
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Solving it in S we get 


S = 


with 


MT) = 


A{T) 

D{T) 

Wjy^j 




D(T) = 


27 . 


- 1 . 




Notice that A(T) is a centered normal random variable with variance 

00 

E\AiT)\^ = Y, 


“ 27,2 


2 /■“ dx 3 1 

^ (T 27 , + 1)2 (1 + X 2)2 ^ ~ ^ 


and for the non-random denominator D[T) by using (37) and (38) we have 


9 “ 1 


i=l ^ 


1 

f 


Now (39) is confirmed and we are done. 


5 Addendum: Proof of Proposition |2.1 

Proof of Proposition |2.1| We first notice that 


inf <?r(/) <‘f^T(O) = / Q{—B{t))dt<oo. 

/6Wl[0,T] Jo 

Indeed, if B{-) is bounded, then Q{—B{-)) is bounded. If ([^ holds, then we 
have 

<3(ic) < ^ + 1), X G M, 

with A := 2A + Q{0). Hence, if H G Lp+i[0,T], then 

f Q{—B{t))dt<A f -|-1) (it < 00 . 

Jo Jo 

Second, we show that we may restrict the minimization in ([^ on a 
subclass 

Wc,M := { / e W^[0,r] : r fit fdt < C, max |/(t)| < m] (40) 

I Jo 0<t<T J 
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with sufficiently large parameters C and M. To justify this statement, it is 
sufficient to show that for each r > 0 there exist C, M such that 

{/ : Srif) <r}C Wc,m- 

Let / be such that Srif) < Then 

fitfdt < StU) < r 

and we may let C := r. Moreover, for any 0 < s, t < T we have \fis)—f{t)\ < 
by using Holder inequality. Therefore, if |/(s)| > M holds for some 
s G [0,T], then \f{t)\ > M — \/rT for all t G [0,T]. We see that 



r > £T{f)> [ Q{f{t)-B{t))dt 

> Leb{t : \B{t)\ < (M - VbT)/2} inf{Q(x) : |x| > (M - \4 t)/2}. 

When M goes to infinity, then the first term tends to T, while the second 
tends to infinity due to the assumption limj,_,.±oo Q{x) = +oo. Therefore 
for large M we obtain a contradiction. Hence, for such M assumption 
|/(s)| > M cannot hold, and (40) is confirmed. 

Next, we show that the minimum of the problem (|^ is attained. Since 
the functional £t{') is lower semi-continuous with respect to the uniform 
convergence (notice that the potential part of the energy is even continuous), 
and since Wc,m is relatively compact with respect to the topology of uniform 
convergence, the minimum of £t{') on 'Wc,M is indeed attained on some set 
of minimizers. 

Next, since Q{-) is strictly convex, the functional Sri') is also strictly 
convex, hence the minimizer is unique. Let us denote it Jt- 

By Lebesgue theorem. Gateaux (directional) derivative of Tt(') 

SSrifr, e) := hm h~^ [SrifT + he) - StUt)) 
h^O 

= [ [2/r(i)^'W + Q'ifTit) - B{t))e{t)] dt 

Jo 

is well defined and must vanish at /t for any e G W2[0,T]. We have thus 


/o 


[2fTit)e'{t) + Q'ifrit) - B{t))e{t)] dt = 0. 


(41) 


To justify the application of Lebesgue dominated convergence theorem to the integrals 

cT 


/ 


[Q(/t(B + he{t) - Bit)) - Qihit) - B{t))] dt, 


notice that both functions /t and et are bounded, say, max |/T(t)| < Mi, max |£(t)| < M2. 
Therefore, if B{-) is bounded, say, max|B(t)| < M3, then for h < 1 the integrand is 
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uniformly bounded by a constant \Q'{x)\ and we use the fact that 

the derivative of a convex function (if it exists everywhere) is locally bounded. 
Alternatively, if Q holds, we have 

h-^\Q{fT{t) + he{t) - B{t)) - QiMt) - Bm 
= IQ'ifrit) + Ohe{t) - B(t))\\e{t)\ 

< A{{Mi+M2 + \Bit)\f+ 1)M2, 


which also provides an integrable majorant due to B G Lp[0,T]. 


Let us fix some r G (0, T] and apply (41) to the functions 

/ 

£u{t) := < 


1, 0 < t < r — u; 

0, t > t; 

— t), T — u < t < T, 


with 0 < u < T. We obtain 

0 = / [2ffp{t)e'^{t) + Q'ifrit) - B{t))euit)] dt 


= —2u 


-1 


fT{t)dt 


+ / Q'{fT{t) - B{t)) dt + / Q'(/r(t) - B{t))eu{t) dt 

Jo jT—U 

= -2u~^{fT{T) - frir-u)) 

PT — U PT 

+ / Q'ihit) - B{t)) dt + / Q'(/r(t) - B{t))eu{t) dt. 

Jo Jt—u 

Now we let u \ 0 and see that the left derivative {fT)'-{T) exists and 


2(/T)'_(r)= r Q'{fT{t)-Bit)) dt. (42) 

Jo 

In exactly the same way one obtains 

2 (/t)'+(^) = -f Q'iMt) - Bit)) dt, (43) 

Since /-^(•) exists almost everywhere, for some r G (0, T) we have (/t)(_(t) = 
ifTY+ir), i.e. 

Q\fTit)-Bit))dt = 0. (44) 

This fact in turn proves that (/t)(_(t) = (/t)+(t) for every r G (0, T), i.e. 
the function /t(') is differentiable everywhere and we have 



2/t(t) = -/ Q'ifTit)-Bit))dt, rG(0,r). 


By (44), the boundary conditions (/r)(|_(0) = (/t)-( 7") = 0 also follow from 
representations (42) and (43). □ 
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6 Concluding remark 


Of course, one would like to handle the case of more or less general penalty 
functions Q. But in this case the equation replacing ([^ is not linear and 
we don’t know whether we may proceed with some, may be inexplicit, ana¬ 
logues of exponential functions. It would be also nice to guess a stationary 
approximation for least energy functions related to general penalty Q. 
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